A symbolic data-driven technique based on evolutionary polynomial regression
نویسندگان
چکیده
Orazio Giustolisi (corresponding author) Faculty of Engineering, Department of Civil and Environmental Engineering, Technical University of Bari, via Turismo 8, Q. re Paolo VI, 74100, Taranto, Italy Tel: +39 080 596 4214 E-mail: [email protected] Dragan A. Savic Centre for Water Systems, Department of Engineering, School of Engineering, Computer Science and Mathematics, University of Exeter, North Park Road, Exeter EX4 4QF, UK Tel: +44 1392 263637 E-mail: [email protected] This paper describes a new hybrid regression method that combines the best features of conventional numerical regression techniques with the genetic programming symbolic regression technique. The key idea is to employ an evolutionary computing methodology to search for a model of the system/process being modelled and to employ parameter estimation to obtain constants using least squares. The new technique, termed Evolutionary Polynomial Regression (EPR) overcomes shortcomings in the GP process, such as computational performance; number of evolutionary parameters to tune and complexity of the symbolic models. Similarly, it alleviates issues arising from numerical regression, including difficulties in using physical insight and overfitting problems. This paper demonstrates that EPR is good, both in interpolating data and in scientific knowledge discovery. As an illustration, EPR is used to identify polynomial formulæ with progressively increasing levels of noise, to interpolate the Colebrook-White formula for a pipe resistance coefficient and to discover a formula for a resistance coefficient from experimental data.
منابع مشابه
APPLICATION OF EVOLUTIONARY POLYNOMIAL REGRESSION IN ULTRAFILTRATION SYSTEMS CONSIDERING THE EFFECT OF DIFFERENT PARAMETERS ON OILY WASTEWATER TREATMENT
In the present work, the effects of operating conditions including pH, transmembrane pressure, oil concentration, and temperature on fouling resistance and the rejection of turbidity for a polymeric membrane in an ultrafiltration system of wastewater treatment were studied. A new modeling technique called evolutionary polynomial regression (EPR) was investigated. EPR is a method based on regres...
متن کاملAdvances in Data-driven Analyses and Modelling Using Epr-moga
Evolutionary Polynomial Regression (EPR) is a recently developed hybrid regression method that combines the best features of conventional numerical regression techniques with the genetic programming/symbolic regression technique. The original version of EPR works with formulae based on true or pseudo-polynomial expressions using a single-objective genetic algorithm. Therefore, to obtain a set o...
متن کاملData Mining for Management and Rehabilitation of Water Systems: The Evolutionary Polynomial Regression Approach
Risk-based management and rehabilitation of water distribution systems requires that company asset data are collected and also that a methodology is available to efficiently extract information from data. The process of extracting useful information from data is called knowledge discovery and at its core is data mining. This automated analysis of large or complex datasets is performed to determ...
متن کاملبررسی نقش عوامل مؤثر بر فراوانی حوادث در لولههای اصلی آب رسانی با استفاده از الگوی رگرسیونی ترکیبی
A water distribution network is one of the important parts of infrastructure systems. The efficient management and proactive planning of capital investment of these assets are fundamental for efficient and effective service delivered by water companies. The direct economic costs (i.e. rehabilitation investment, repair costs, water loss, etc.) as well as indirect costs (i.e. service and traffic ...
متن کاملShuffled Frog-Leaping Programming for Solving Regression Problems
There are various automatic programming models inspired by evolutionary computation techniques. Due to the importance of devising an automatic mechanism to explore the complicated search space of mathematical problems where numerical methods fails, evolutionary computations are widely studied and applied to solve real world problems. One of the famous algorithm in optimization problem is shuffl...
متن کامل